JUst CONcatenation - A Corpus-based Approach and its Limits
نویسنده
چکیده
This paper describes a radical corpus-based approach to speech synthesis. No signal manipulation is performed and the synthesis becomes a mere concatenation. The feasibility of this approach is evaluated regarding corpus selection constraints and realization of different prominence patterns. A “traditional” concatenative system serves as a baseline. The results indicate that the size of the corpus must be rather large in order to obtain satisfying and reliable results for unlimited text-to-speech conversion.
منابع مشابه
A close look into the probabilistic concatenation model for corpus-based speech synthesis
We have proposed a novel probabilistic approach to concatenation modeling for corpus-based speech synthesis, where the goodness of concatenation for a unit is modeled using a conditional Gaussian probability density whose mean is defined as a linear transform of the feature vector from the previous unit. This approach has shown its effectiveness through a subjective listening test. In this pape...
متن کاملOn the Suitability of Vocalic Sandwiches in a Corpus-Based TTS Engine
Unit selection speech synthesis systems generally rely on target and concatenation costs for selecting the best unit sequence. The role of the concatenation cost is to insure that joining two voice segments will not cause any acoustic artefact to appear. For this task, acoustic distances (MFCC, F0) are typically used but in many cases, this is not enough to prevent concatenation artefacts. Amon...
متن کاملA probabilistic approach to unit selection for corpus-based speech synthesis
In this paper, we present a novel statistical approach to corpus-based speech synthesis. Unit selection is directed by probabilistic models for F0 contour, duration, and spectral characteristics of the synthesis units. The F0 targets for units are modeled by statistical additive models, and duration targets are modeled by regression trees. Spectral targets for a unit is modeled by Gaussian mixt...
متن کاملForward Masking Phenomenon in Concatenative Speech Synthesis
The approach described in the paper tries to get more knowledge to the concatenative text-to-speech system design. The knowledge is based on masking phenomenon of the inner ear, particularly of its temporal (forward) masking properties. Designing such knowledge-based system is suggested to use in the unit selection-based speech synthesis, as contemporary a prominent technique in concatenative s...
متن کاملCorpus Design for Malay Corpus-based Speech Synthesis System
Problem statement: Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesis speech sound. Approach: This study proposes a corpus design for Malay corpus-based speech synthesis system. This includes the study of design criteria in corpus-based speech synthesis, Malay corpus based database design and the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998